A Novel Robust MFCC Extraction Method Using Sample-ISOMAP for Speech Recognition
نویسندگان
چکیده
According to the nonlinear characteristic of the speech signal, this paper presents a novel robust MFCC extraction method using sample-ISOMAP. ISOMAP is a nonlinear dimensionality reduction method based on the theory of manifold, it can reveal the meaningful low-dimensional structure hidden in the high-dimensional observations. In the proposed method, ISOMAP is first applied for calculating the non-linear mapping matrix which comes from the consistency mixed matrix. The consistency mixed matrix is composed of the logarithm of Mel filter bank energies derived from the sample data. Then the non-linear mapping matrix is used to replace the DCT procedure in the classic MFCC method. Experiments based on the recognition system established by HTK3.3 and Aurora2.0 speech database show that the robustness of the proposed method is superior to the PCA-MFCC and MFCC methods, and the recognition rate has been notably raised under low SNRs.
منابع مشابه
Improving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملEnvironment Independent Speech Recognition System using MFCC (Mel-frequency cepstral coefficient)
Speech recognition is a method of finding similarity between two sequences. Various researches have been done on it. In our research, we are trying to achieve the optimal accuracy during the recognition procedure. Here, we are extracting features of the voice sample before filtering it through a noise reduction filter. For each individual, there are number of features are taken using feature ex...
متن کاملRobust Speech Feature Extraction Using the Hilbert Transform Spectrum Estimation Method
The performance of traditional mel-frequency cepstral coefficients (MFCC) speech feature extraction method decreases drastically in the complex noisy environment. To improve the performance and robustness of speech recognition system, which is based on spectral envelope estimation method, the minimum distortionless response spectrum MVDR-MFCC (Minimum Variance Distortionless Response-MFCC) feat...
متن کاملNew Features Using Robust MVDR Spectrum of Filtered Autocorrelation Sequence for Robust Speech Recognition
This paper presents a novel noise-robust feature extraction method for speech recognition using the robust perceptual minimum variance distortionless response (MVDR) spectrum of temporally filtered autocorrelation sequence. The perceptual MVDR spectrum of the filtered short-time autocorrelation sequence can reduce the effects of residue of the nonstationary additive noise which remains after fi...
متن کاملA Robust Front-End Processor combining Mel Frequency Cepstral Coefficient and Sub-band Spectral Centroid Histogram methods for Automatic Speech Recognition
Environmental robustness is an important area of research in speech recognition. Mismatch between trained speech models and actual speech to be recognized is due to factors like background noise. It can cause severe degradation in the accuracy of recognizers which are based on commonly used features like mel-frequency cepstral co-efficient (MFCC) and linear predictive coding (LPC). It is well u...
متن کامل